Title Kolmogorov Complexity Estimation and Analysis

نویسنده

  • G. Saulnier
چکیده

Methods for discerning and measuring Kolmogorov Complexity are discussed and their relationships explored. A computationally efficient method of using Lempel Ziv 78 Universal compression algorithm to estimate complexity is introduced. 1 Abstract—Methods for discerning and measuring Kolmogorov Complexity are discussed and their relationships explored. A computationally efficient method of using Lempel Ziv 78 Universal compression algorithm to estimate complexity is introduced. I. INTRODUCTION olmogorov Complexity is a fundamental measure of information with growing applications and importance [2], [4]. Estimation of Kolmogorov Complexity is key to objective information system monitoring and analysis. References [7], [2]-[4] contain many applications of Kolmogorov Complexity; also see [1] for background on this subject. All applications of Kolmogorov Complexity are limited due to its incomputable nature and are impacted by improvements or innovations in the ability to estimate Kolmogorov Complexity well. In this paper we review a generic method for estimating complexity – the Lempel-Ziv 78 (LZ78) [11] universal compression algorithm, discuss its limitations in estimating complexity, and derive a computationally efficient method of using this algorithm to estimate complexity. We then develop two measures for the estimation of complexity – power spectral density based estimation and expected time of sequence production. We discuss the relationships between these methods of estimation and other estimators. Additionally we introduce a third parameter related to complexity – SPAN. Relationships between these measures of complexity are compared and discussed, and their relationships to compression-based estimates of Kolmogorov Complexity are explored.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Minimum complexity density estimation

The minimum complexity or minimum description-length criterion developed by Kolmogorov, Rissanen, Wallace, So&in, and others leads to consistent probability density estimators. These density estimators are defined to achieve the best compromise between likelihood and simplicity. A related issue is the compromise between accuracy of approximations and complexity relative to the sample size. An i...

متن کامل

On The Effectiveness of Kolmogorov Complexity Estimation to Discriminate Semantic Types

— We present progress on the experimental validation of a fundamental and universally applicable vulnerability analysis framework that is capable of identifying new types of vulnerabilities before attackers innovate attacks. This new framework proactively identifies system components that are vulnerable based upon their Kolmogorov Complexity estimates and it facilitates prediction of previously...

متن کامل

Symbol Compression Ratio for String Compression and Estimation of Kolmogorov Complexity

A new compression algorithm is derived that computes and encodes the Minimum Message Length (MML) near optimal partition of symbols in a string for compression. Using Symbol Compression Ratio (SCR) as a driving function this algorithm produces a binary tree model of the data that introduces a fundamental parameter of information related to Kolmogorov Complexity – the size of the alphabet in the...

متن کامل

Statistical Analysis and Model Validation of Gompertz Model on different Real Data Sets for Reliability Modelling

A very basic problem in reliability modeling is to obtain information about the form of the population from which the sample is drawn. Goodness of fit test is employed on different real data sets to determine how well the observed sample data "fits" on proposed model for reliability analysis. In this paper, we obtain the tables and graphs of critical values of Kolmogorov-Smirnov (KS) test, and ...

متن کامل

Applications of Kolmogorov Complexity and Universal Codes to Nonparametric Estimation of Characteristics of Time Series

We consider finite-alphabet and real-valued time series and the following four problems: i) estimation of the (limiting) probability P (x0 . . . xs) for every s and each sequence x0 · · ·xs of letters from the process alphabet (or estimation of the density p(x0, . . . , xs) for real-valued time series), ii) the so-called on-line prediction, where the conditional probability P (xt+1|x1x2 . . . x...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002